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1. Introduction 



The family A^^ of /c- monotone densities on is a mixture model defined by 
the basic density 170(2^) = — a;)^~^. That means that every density / G A4k 
is a multiplicative convolution of some mixing probability distribution G and 
50: 











Jo V 





This class generalizes classes of monotone (1— monotone) and convex decreas- 
ing (2-monotone) densities on As k increases, the classes A4k decrease and 
the class of completely monotone densities lies in the intersection of all Aik- 
Balabdaoui and Wellner [2009] proves consistency and Balabdaoui and Wellner 
[2007] gives asymptotic limit theory for the MLE of a fc— monotone density. 
For fc = 1 the uniqueness of the MLE follows from a well known character- 
ization of the Grenander estimator, for k = 2 the uniqueness was proved in 
Groeneboom et al. [2001] and for = 3 in Balabdaoui [2004]. In JeweU [1982] 
the uniqueness and weak consistency was proved for the MLE of a completely 
monotone density. In Lindsay and Roeder [1993] a general method was devel- 
oped which allows to prove uniqueness of the MLE in mixture models defined 
by strictly totally positive kernels such as the exponential kernel for completely 
monotone densities. However, for fc— monotone densities the kernel is not strictly 
totally positive as follows from Schoenberg and Whitney [1953] and uniqueness 
of the MLE for fc > 3 has not been investigated before. Below, we answer this 
question proving the following theorem: 
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Theorem 1.1. Let k > 2 be a positive integer and Xi, . . . , Xn he i.i.d. random 
variables with k-monotone density fo G Aik- With probability 1 the MLE /„ is 
uniquely defined by some discrete mixing probability measure Gn ■ The cardinality 
of the support set supp(G'„) is less or equal than n. If supp(G„) consists of 
points Yi < ■■■ < Yj-n, m < n then there exists a subset of order statistics 
■^{ii) < • • ■ < such that: 

where we assume X{ii) = +oo for I > m. Also we have Y^ > 
2. Proof 

With probabihty 1 we have that the order statistics are distinct: 

< < •■• < X(„). 

We assume that = +oo for I > n. 

The proof consists of several lemmas. First lemma provides a tool for counting 
zeroes of a function. 

Lemma 2.1. Let A C 'E. be an interval. Consider a function f G C'^{A). Then 
the following inequality is true for the number of zeroes N{f, A) of f counted 
with multiplicities: 

iV(/W,A) >iV(/,A)-fc. 

Here the multiplicity of zero x of the function f G C''{A), k > 0, x G A is an 
integer n < k + 1 such that: 

f{y) = --- = &-'\y) = o 

and either n = k + 1 or f^^\y) ^ 0. 

Proof. Let Xi, 1 < i < m be zeroes of / on A and be its multiplicities. Then 
by RoUe's theorem the derivative /' G C'^~^{A) has m — 1 zeroes between points 
Xi with multiplicities at least 1 and also zeroes Xi with multiplicities n^ — 1. Thus: 

VI m 

N{f', A)>m-1 + Y,{n, - 1) = -1 + ^ = N{f, A) - 1. 

1=1 1=1 

Applying this inequality k times we obtain the result. □ 

Proof of the second lemma builds the necessary tools for the main result. 
Note, that discreteness of the mixing measure Gn actually follows from gen- 
eral results about the MLE for mixture models proved in Lindsay [1983a]. We 
repeat a part of that argument to make our proof self-contained and also to 
provide references to the classic convex analysis textbook Rockafellar [1970] for 
an interested reader. 
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Lemma 2.2. There exists a set Y of points Yi < • • • < Ym, m < n such that 
k— monotone density /„ defined by a mixing probability measure Gn is the MLE 
if and only if supp^Gn) CY. In other words, any MLE has the form: 

Ux)=±a, ''^''^-f" , (2.1) 

where Oj > 0, ^o,j ^ 1 and Yj > X(^jy Moreover, the vector (/n(^(i)))"=i is 
unique. 

Proof. Following Lindsay [1983a, 1995] wc define a curve F parameterized by 
y e M+ as: 

Since go{x/y) as a function of y is bounded and equals zero at and +oo we have 
that F is compact. Suppose that f G A4k with corresponding mixing probability 
measure G. Then the vector belongs to the set f ~ cl(conv(F)): 



(/(^W))r=i = / ny)dG{y) 
Jo 

Since F is compact it follows that F is also compact and F = conv(F) by 
(Rockafellar [1970], Theorem 17.2). Therefore, the continuous function 0"=! 
attains its maximum on F, where Zi are the coordinates in the space M". Let 
us denote 5* = argmaxj=, HiLi ^i- 

Since the intersection of F and the interior ri(M" ) of R" is not empty we 
have S C ri(R" ). The function 11"=! strictly concave, therefore S consists 
of a single point b = (6i)"^i > 0. Therefore for any MLE /„ it follows that the 
vector {fniX(^i-)))"^i is equal to b and unique. The gradient of YYi=i -^i b is 
proportional to l/b= (1/6,)"^;^. 

We have dim(r) = n. Indeed if we consider n points ti E Xjj^^)) then 

the vectors T{ti) are linearly independent (Schoenberg and Whitney [1953] or 
direct observation). By (Rockafellar [1970], Theorem 27.4) the vector 1/5 belongs 
to the normal cone of F at b. Since 1/b > we have b E dT and the plane a 
defined by the equation "^2^=1 ^i/^i = is a support plane of F at b. Thus for 
Vi = k/nbi we have: 

n 

p(y)^/-^i,,(y-X(,))^-i >0, 

i=l 

for all y > and p{y) = if and only if y = or T{y) G a. Let us denote by Y 
the set of y such that T{y) g a i.e. T{Y) = a n F. 

The intersection a n F is an exposed face (Rockafellar [1970], p. 162) of F. 
By Theorem 18.3 from Rockafellar [1970] we have a n f = conv(F(y)) and by 
Theorem 18.1 we have supp(G„) C Y. 
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The function p{y) belongs to C^^^"^^ at y = it has a zero of muhiphcity 
fc — 1 and at each y it has a zero of multiplicity greater or equal than 2. By 
Lemma 2.1 we have: 

A^(p('=-2), [0,X(,)]) > N{p, [0,X(,)]) - (fc - 2) > 2#(r n [0,X(,)]) + 1. 

The function pC''"^) jg piecewise quadratic and therefore on each interval 
it may have at most two zeroes and on [0,X(i)) it has only one 
zero at zero. Thus: 

1 + 2(j - 1) > iV(p('=-2), [0, > 2#(y n [0, + 1 

or equivalently the set Y has no more than j — 1 points on [0, -^(j)] • In particular, 
the set Y contains no more than n points. Let Yi < ■ • ■ < Y,n be the points of 
Y . Then we have Yj > ^(j)- 

This implies that for any MLE /„ the support of the corresponding mixing 
measure G„ is a subset of Y and thus any MLE /„ has the form: 

j=i 

where aj > 0, — ^- ^ 

The last lemma proves uniqueness of the MLE. 

Lemma 2.3. The discrete mixing probability measure Gn which defines an MLE 
is unique. 

Proof. Suppose there exist two different MLEs and ff^. Then by Lemma 2.2 
we have: 

I k{Yj — x)_^_ 



j = l 3 



and therefore the function q{x) defined by: 

where Sj = aj — admits at least n zeroes X^^^ and not all Sj are equal to 
zero. Therefore, by Lemma 2.1: 

N{q^''-'\[0,X,))>Niq, [0,X,))-(fc-2) >z-l-(fc-2). 

The function is piecewise linear with knots at Yj . Therefore, N{q^'^~^'> , [0, X;)) < 

#{yn[0,X(,))} + l and: 

#{yn[o,X(,))}>z-fc. 

This implies: 
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and by Lemma 2.2 



Now, by Corollary 1 of Schoenberg and Whitney [1953] we have c\ci{{Yi~- X (^j-^)^)'; 
0. Thus the vectors r(Fi), 1 <i <m are independent and all — 0. This eon- 
tradietion completes the proof. □ 



,m 



> 



Note, that from Lemma 2.2 it follows that any mixing probability measure 
on a finite set Y = {Yj} defines MLE. Lemma 2.3 shows that the MLE is unique 
and hence the vectors ^(Yj) are independent. In particular, there exists a subset 
of order statistics ^(i^) < • • • < ^(i„,) such that: 



det((y,-X(,))5_-i)™^^>0. 




The equality (2.1) also implies that for any MLE y„j > since otherwise 

the last coordinate of all vectors r(Yj) is equal to zero and fn{X^n)) = and 
this completes the proof of Theorem 1.1. 
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